Multinomial logit models with implicit variable selection

نویسندگان

  • Faisal Maqbool Zahid
  • Gerhard Tutz
چکیده

Multinomial logit models which are most commonly used for the modeling of unordered multi-category responses are typically restricted to the use of few predictors. In the high-dimensional case maximum likelihood estimates frequently do not exist. In this paper we are developing a boosting technique called multinomBoost that performs variable selection and fits the multinomial logit model also when predictors are high-dimensional. Since in multicategory models the effect of one predictor variable is represented by several parameters one has to distinguish between variable selection and parameter selection. A special feature of the approach is that, in contrast to existing approaches, it selects variables not parameters. The method can distinguish between mandatory predictors and optional predictors. Moreover, it adapts to metric, binary, nominal and ordinal predictors. Regularization within the algorithm allows to include nominal and ordinal variables which have many categories. In the case of ordinal predictors the order information is used. The performance of boosting technique with respect to mean squared error, prediction error and the identification of relevant variables is investigated in a simulation study. For two real life data sets the results are also compared with the Lasso approach which selects parameters.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Selection of multinomial logit models via association rules analysis

In this research, we propose a novel approach for a multinomial logit model selection procedure: specifically, we apply association rules analysis to identifying potential interactions for multinomial logit modeling. Interaction effects are very common in reality, but conventional multinomial logit model selection methods typically ignore them. This is especially true for higher-order interacti...

متن کامل

Working Paper Series Categorical Data Categorical Data

Categorical outcome (or discrete outcome or qualitative response) regression models are models for a discrete dependent variable recording in which of two or more categories an outcome of interest lies. For binary data (two categories) probit and logit models or semiparametric methods are used. For multinomial data (more than two categories) that are unordered, common models are multinomial and...

متن کامل

Modeling the behavior of disordered taxi drivers of Tehran for choosing passenger and destination

In this study, the manner of private taxis drivers has been investigated for choosing passenger and destination from a fixed point. Therefore, two models called Multinomial and Nested Logit Models have been utilized. The information gained by scrolling in 2016 is the input data, which are in the format of revealed preference, acquired by the verbal interview in Vanak Square in Tehran (Iran). Ba...

متن کامل

Variable selection in general multinomial logit models

The use of the multinomial logit model is typically restricted to applications with few predictors, because in high-dimensional settings maximum likelihood estimates tend to deteriorate. In this paper we are proposing a sparsity-inducing penalty that accounts for the special structure of multinomial models. In contrast to existing methods, it penalizes the parameters that are linked to one vari...

متن کامل

Multinomial Logistic Regression Ensembles

This article proposes a method for multiclass classification problems using ensembles of multinomial logistic regression models. A multinomial logit model is used as a base classifier in ensembles from random partitions of predictors. The multinomial logit model can be applied to each mutually exclusive subset of the feature space without variable selection. By combining multiple models the pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Adv. Data Analysis and Classification

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2013